Visualising Search Result Sets Using a Force-Based Method to Form Clusters of Similar Documents

نویسنده

  • Susanne Mayr
چکیده

As human knowledge increases, so the volume of electronically available information grows. Finding specific information becomes more difficult and ever more matches are returned in response to a search query. Since quantity is seldom quality, numerous approaches to make sense of search result sets have been proposed. This thesis describes an approach called SearchVis to visualise search result sets, which is based on an approach by Matthew Chalmers described in his 1996 paper ’A Linear Iterative Layout Algorithm for Visualising High-Dimensional Data’. The visualisation concentrates on the similarities between the documents retrieved. An animated, force-based technique produces clusters of similar documents. Through this technique similar documents are attracted and non-similar documents repelled. SearchVis allows the user to adjust the visual discrimination of the clusters using different parameters. It was tested with a varity of test data sets for a wide range of parameter settings. In order to reach as wide an audience as possible, SearchVis was written in Java.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی نقش انواع بافتار هم‌نویسه‌ها در تعیین شباهت بین مدارک

Aim: Automatic information retrieval is based on the assumption that texts contain content or structural elements that can be used in word sense disambiguation and thereby improving the effectiveness of the results retrieved. Homographs are among the words requiring sense disambiguation. Depending on their roles and positions in texts, homograph contexts could be divided to different types, wit...

متن کامل

Comparison of Strategic Plans of Universities and Institutes of Higher Education with a Quantitative Approach

Strategic planning in Iranian universities and institutes of higher education is generally prepared using strategic planning models introduced by experts and other universities. These programs will be published in the form of university strategic planning documents. These documents have such features that can be similar or different than the programming templates used. Existence of the similar...

متن کامل

Text Summarization Using Cuckoo Search Optimization Algorithm

Today, with rapid growth of the World Wide Web and creation of Internet sites and online text resources, text summarization issue is highly attended by various researchers. Extractive-based text summarization is an important summarization method which is included of selecting the top representative sentences from the input document. When, we are facing into large data volume documents, the extr...

متن کامل

Coronavirus: Discover the Structure of Global Knowledge, Hidden Patterns & Emerging Events

Background & Objective:  The present study aimed at exploring the structure of global knowledge, hidden patterns, and emerging Coronavirus events using co-word techniques. Co-word analysis is one of the most efficient scientific methods to analyze the structure and dynamics of knowledge and the general state of research.  Materials & Methods:  This applied research performed using Co-word anal...

متن کامل

Clustering multilingual documents by estimating text - to - text semantic relatedness

This thesis is about multilingual document clustering through estimating semantic relatedness between multilingual texts. Specifically we focus on the task of clustering multilingual documents with very limited or no supervisory information. We present two approaches to address the problem : a comparable-corpora based approach and a web-searches based approach. Our first approach derives pairwi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997